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Response-adaptive randomization has recently attracted a lot of 
attention in the literature. In this paper, we propose a new and sim- 
ple family of response-adaptive randomization procedures that attain 
the Cramer-Rao lower bounds on the allocation variances for any al- 
location proportions, including optimal allocation proportions. The 
allocation probability functions of proposed procedures are discontin- 
uous. The existing large sample theory for adaptive designs relies on 
Taylor expansions of the allocation probability functions, which do 
not apply to nondifferentiable cases. In the present paper, we study 
stopping times of stochastic processes to establish the asymptotic ef- 
ficiency results. Furthermore, we demonstrate our proposal through 
examples, simulations and a discussion on the relationship with ear- 
lier works, including Efron's biased coin design. 

1. Introduction. Scientific and clinical investigations often demand that 
a new intervention or treatment be compared against a control group. Ran- 
domization is the preferred way of assigning participants to the control and 
the treatment groups. In some clinical trials, response- adaptive randomiza- 
tion procedures are desirable for ethical and efficiency reasons 
[Hu and Rosenberger (2006)]. The three main components of response-adaptive 
randomization are allocation proportion, efficiency (power), and variability. 

The development of response-adaptive randomization and optimal allo- 
cation proportion was briefly reviewed in the introduction of Hu and Zhang 
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(2004). In this paper, we denote optimal allocation proportions as alloca- 
tion proportions that are derived from certain optimality criteria (see ex- 
amples in Section 2). The issue of efficiency or power was discussed by 
Hu and Rosenberger (2003), who showed that the efficiency is a decreas- 
ing function of the variability induced by the randomization procedure for 
any given allocation proportion. More recently, Hu, Rosenberger and Zhang 
(2006) showed that there was an asymptotic lower bound on the variability 
of response-adaptive designs. A response-adaptive design that attains this 
lower bound will be said to be first-order efficient. Most response-adaptive 
randomization procedures in the literature are not first-order efficient. The 
only exception that is known to us is the drop-the-loser rule [Ivanova (2003)], 
but its applications are limited to urn allocation proportion (not generally 
optimal) and binary responses [see Hu, Rosenberger and Zhang (2006) for 
details] . 

In this paper, we address the open problem of finding an efficient and 
randomized procedure that can adapt to any desired allocation propor- 
tion (including optimal allocation proportions). Specifically, we propose a 
new family of efficient randomized-adaptive designs (ERADE) that is easy 
to implement in practice for both discrete and continuous responses. Un- 
der some mild conditions, we obtain the asymptotic normality and strong 
consistency of both the allocation proportions and the estimators of pop- 
ulation parameters. Our results provide a solid foundation for the efficient 
randomized-adaptive designs and the related statistical inferences based on 
such designs. Several advantages of the procedure are also demonstrated 
through examples. 

In the literature, asymptotic properties of adaptive designs are studied 
under continuous and differentiable allocation probability function by us- 
ing Taylor expansion. When the allocation probability function is discrete, 
the commonly used techniques do not work anymore. The allocation prob- 
abilities of the ERADEs are discrete functions. In this paper, we overcome 
the difficulties of discontinuity by introducing a stopping time of a mar- 
tingale process. This may provide a novel direction to study the properties 
of adaptive designs with discrete probability functions. Efron's biased coin 
design [Efron (1971)] is a special case of ERADE for balancing two treat- 
ments. Rosenberger and Lachin [(2002), page 49] showed, by simulation, 
that Efron's biased coin design is more efficient than adaptive biased coin 
designs proposed by Wei (1978) and Smith (1984). The asymptotic results 
in this paper show the efficiency of Efron's biased coin design theoretically. 

The paper is organized as follows. In Section 2, we propose a simple family 
of response-adaptive randomization procedures with some important asymp- 
totic properties. They are shown to attain the Cramer-Rao lower bound on 
the allocation variabilities. The efficient procedures are derived for several 
commonly used optimal allocation proportions. In Section 3, we conduct a 
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simulation study to examine the properties of the proposed designs. Section 
4 demonstrates the potential value of the ERADE through a careful anal- 
ysis of the ECMO trials. We conclude the paper with several remarks and 
potential further research directions in Section 5. Technical details are given 
in the Appendix. 

2. Efficient randomized-adaptive designs. We first describe the frame- 
work for the randomized-adaptive designs. 

General framework. We consider two-treatment clinical trials. Suppose 
that the patients come to the clinical trial sequentially and respond to treat- 
ments without delay. After the first m patients are being assigned to treat- 
ments and the responses observed, the (m -|- l)th patient will be assigned 
to treatment 1 with probability Pm+i and treatment 2 with probability 
1 — pm.+i- The probability Pm+i may depend on both the treatments as- 
signed to and the responses observed of the previous m patients. Let X^^k 
be the result of the mth assignment (i.e., Xm,k = 1 if the mth patient is 
assigned to treatment k, and otherwise). We assume that the patient 
responses with each treatment are i.i.d. with the probability distribution 
indexed by 0^. G (A; = 1,2). The cases of d=l (for binary response) or 
d = 2 (for normally distributed response) are typical. 

Let Nm,k = J2]Li Xj^k be the number of patients assigned to treatment 
k in the first m patients. We assume that the desired allocation proportion 
of patients assigned to each treatment is a function of G = {61,62) [see 
Rosenberger et al. (2001) for a related discussion]. More specifically, the goal 
of allocation is to have Nm.i/fn — > u = p{@), where p{-) iTZ'^^'^ (0, 1) is an 
allocation proportion. 

We further assume that the parameter estimate 6m,k of 0fc based on an 
m-patient study has the Bahadur-type representation 

m 

(2.1) 6„,,k = N-\ X,, fc^^. + o(iV^.f ) as m ^ 00 

i=i 

for some i.i.d. sequences of random variables {^j^fc, j = 1, . . .}, where is 
the response or a function of the response of the mth patient on treatment 
k. 

In most applications, the response distributions belong to an exponential 
family. Then, we take 0^ = E[^if.], where are the natural sufficient 

statistics, and ^m.fc is the average of the observed sufficient statistics. In 
practice, we may start with ©q = (^0,1)^0,2) as an initial estimate of and 
use 
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to ensure a well-defined estimator even when no patients are assigned to 
treatment k. The initial estimate 0o is a guessed value of or an estimate 
of from early trials. 

Efficient randomized- adaptive design (ERADE). To start, we assign uiq 
(usually mo = 2) subjects to each treatment by using restricted randomiza- 
tion. Assume that m (m > 2mo) subjects have been assigned in the trial, 
and their responses are observed. Let &m be the estimator defined in (2.1) 
based on the m observations, and pm = p{&m)- Then, our proposed ERADE 
assigns the (m + l)th patient to treatment 1 with probability 

( apm, if Nm,i/m > Pm, 

(2.3) Pm+l = lpm, ^fNm,l/m = Pm, 

[l- a{l -pm), if Nm,i/m<pm, 

where < a < 1 is a constant that reflects the degree of randomization. See 
Remarks 2.1 and 3.1 for details of choosing a. 

It is important to note that the allocation probability in (2.3) is a dis- 
crete function. In the literature, allocation probability functions are gener- 
ally continuous and differentiable [Hu and Rosenberger (2006)], which allows 
for Taylor expansions in the asymptotic analysis. We have to use a different 
approach to establish the asymptotic properties in the Appendix. 

Example 1 (Efron's biased Coin design). To balance patients in two 
treatment groups (with p = l/2 as the target), our proposed ERADE assigns 
the (m + l)th patient to treatment 1 with probability 

{a/2, if Nm,i/m>l/2, 

(2.4) pm+i = {l/2, if Nm,i/m = l/2, 

[l-a/2, if Nm,i/m<l/2. 

The special case of a = 2/3 is Efron's biased coin design [Efron (1971)]. 

Remark 2.1. Efron's biased coin design played a very important role 
in the randomization literature. This idea has been developed further by 
Pocock and Simon (1975), among others, for randomization procedures that 
depend on prognostic variables. However, it is difficult to study its asymp- 
totic properties because the allocation probability is not a continuous func- 
tion. To overcome the theoretical difficulty, Wei (1978) and Smith (1984) 
proposed adaptive biased coin designs and generalized biased coin designs 
by using continuous allocation probability. Burman (1996) introduced the 
expected p-value deficiency to evaluate the performance of a given design. 
Based on Burman's studies, it is reasonable to choose a between 0.4 and 0.7. 
Also, Baldi Antognini and Giovagnoli (2004) modified Efron's procedure by 
using more fiexible forms of probability functions. 
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Example 2 {Binary response). In the case of binary response, we de- 
note Pi = 1 — Qi and P2 = ^ — Q2, which are the success probabihties of 
treatments 1 and 2, respectively, and S^j^k {j = 1, . . . ,m and k = 1,2) as the 
corresponding responses. If Xj^^ is the treatment assignment indicator of 
jth patient, as defined in the general framework, then the natural choices of 
parameter estimates are 

To target p{Pi,P2) as the allocation proportion, let Pm = p{Pm,i, Pm,2)- 
Then, the ERADE assigns the (m + l)th patient to treatment 1 with prob- 
ability Pm+i, as defined in (2.3). We now list three specific allocation pro- 
portions: 

(i) The urn allocation proportion, given in Zelen (1969), Wei and Durham 
(1978) and Ivanova (2003), 

(2.5) VI = p{Pi,P2) = (1 - Pi)/(2 - Pi - P2); 

(ii) The optimal allocation proportion discussed in Rosenberger et al. 
(2001), 

(2.6) V2=p{Pl,P2) = ^/P[/{^l + y^)■, 

(iii) The Neyman allocation proportion discussed by Jennison and Turn- 
bull (2000), 



(2.7) V3 = p{Pi,P2) = ^/P[Q'i/{Vml+V^)- 

Example 3 (Gaussian response). We compare two treatments with re- 
sponses .^i~N(^i,ri) and ^2~N(/i2,T2), respectively. Let ^j^k, Xj,k and 
Nm,k be defined as in Example 2. Then, the maximum likelihood estimates 
are 

_Yl]LlXj^kCj,k ^2 _J2]Ll-^j,kiCj,k-IJ'm,k)'^ f, _ 1 O 

fJ'm,k — 77 ana r„ — — , k — 

^^m,k ^^m,k 

One can show that the MLEs satisfy the Bahadur-type representation in 
(2.1). We can target any given allocation proportion p{pi, p2,Ti,T2) by us- 
ing its estimator, pm = piP^m,!, fi'm,2,T'm,i,T'm,2), and the ERADE assigns the 
{m + l)th patient to treatment 1 with probability Pm+i as defined in (2.3). 
Three specific examples are as follows: 

(i) The optimal allocation proportion given by Zhang and Rosenberger 
(2006), 



(2.8) V4= p{pi,p2,Tl,T2 



nV/^ + T2^' 

when both pi and p2 are nonnegative; 
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(ii) The Neyman allocation proportion used by Jennison and Turnbull 
(2000), 

(2.9) V5 = p{fii,H2,Ti,T2) = — ■ — ; 

(iii) The D^-optimal allocation proportion (Gwise, Hu and Hu (2008)), 

^4/3 

(2-10) V6 = p{fIi,fl2,Ti,T2) = ^j^ JTg. 

^1 +^2 

Remark 2.2. In Example 2, V2 and are optimal allocation propor- 
tions, where V2 is based on minimizing the expected number of treatment 
failures for fixed variance of Wald test [Rosenberger et al. (2001)], and 
is based on minimizing the variance of the Wald test [Jennison and Turn- 
bull (2000)]. Also, vi is not an optimal allocation proportion, but it is the 
limiting proportion of the play~the-winner rule [Zelen (1969)] and the ran- 
domized play-the-winner rule [Wei and Durham (1978)]. In Example 3, V4, 
f 5 and vq are optimal allocation proportions from different optimal criteria. 

By the results of Hu and Rosenberger (2003), the asymptotic properties, 
especially the asymptotic variance of the allocation, play an important role 
in the efficiency of the design. Now, we turn to the asymptotic properties of 
the proposed ERADE. First, we state two conditions. 

Condition A. In the Bahadur-type representation (2.1), Ok = ^$.i^k 
and E||^i^fc||2 < 00 (A; = 1,2). 

Condition B. The proportion function y p{y) ■.TZ'^^'^ (0,1) is a 
continuous function and is twice differentiable at with p{@) =v. 

These two conditions are mild, and they are satisfied by all the allocation 
functions discussed in Examples 1-3. We now state the main asymptotic 
properties of the ERADE with the notation = Var(^i ^) (A; = 1, 2), and 

V = diagfivi,-^V2) and a'=(^ 
Theorem 2.1. Under Conditions A and B, we have, as n^oo, 




(2.11) 



op{\/n) and 

0{y/n log logn) a.s. 
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Further, we have 

(2.12) ^(0,^_0)^N(O,V), 

(2.13) V^{Nn,i/n-v)^N{0,a^), 

@n — = 0{\Jn~^ log logn) a.s. 

and 

(2.14) Nn,i — nv = 0{\/n log log ?7,) a.s. 

By Theorem 2.1 and the result of Hu, Rosenberger and Zhang (2006), 
we see that the proposed ERADE is indeed asymptotically efficient, so long 
as the parameter estimate 0„ is efficient. Let 1^ be the Fisher information 
matrix for parameter 6^ {k = 1,2). Then, the efficiency result is formally 
summarized in the following theorem. 



Theorem 2.2. Under Conditions A and B, i/Var(^2,fc) = > ^^^^ 
asymptotic variance of Nn^i/y/n for the ERADE attains the Cramer-Rao 
lower bound 



(2.15) 



dp 
dy 



@ 



dmg{{vh)-\i{l-v)l2r')^ 



& 



If the response distributions belong to the exponential family. Theorem 2.2 
establishes the efficiency of the ERADE for any a £ [0, 1). It also explains 
why Efron's biased coin design works well for balancing two treatments 
(p=l/2). 

Remark 2.3. Property (2.11) is a key component in the asymptotic 
analysis of the sample proportion i/n. It shows that the sample propor- 
tion can be approximated by an estimator pn of the target proportion p. 
In the literature, where the allocation probability is generally a continuous 
and differentiable function of pn and / or of the current sample proportion, 
the asymptotic properties of adaptive designs have been obtained by using 
Taylor expansion [cf., Wei (1978), Smith (1984) and Hu and Zhang (2004) 
among many others]. In those cases, the sample proportion cannot be well 
(efficiently) approximated by pn', therefore, the allocation variabilities do 
not attain the Cramer-Rao lower bound. When the allocation probability 
function is discrete, different techniques are needed and different asymptotic 
properties expected. We shall prove (2.11) by defining a stopping time of a 
martingale process to show that the difference between the sample propor- 
tion and the estimated target proportion in our model is small. 



8 



F. HU, L.-X. ZHANG AND X. HE 



Example 1 ( Continued) . Because /o = 1/2 is a constant, the lower bound 
is 0. That is, \/n{Nn^i/n— 1/2) — > in probabihty. Therefore, Efron's biased 
coin design is a higher order approximation to balance two treatments. This 
explains the fact that Efron's biased coin design provides better balanced 
results than the adaptive biased coin design [Wei (1978)] and the generalized 
biased coin design [Smith (1984)]. 

Example 2 (Continued). Both Conditions A and B are satisfied for 
binary response and the three allocations defined in (2.5), (2.6) and (2.7). 
By Theorem 2.1, we have, for the three allocations in (2.5), (2.6) and (2.7), 



with 
(2.16) 



V^{Nn,i/n-Vi)^'N{0,af), i = 1,2,3 

2^ QlQ2{Pl+P2)^ 
^1 (2-Pi-P2)3' 



(2.17) al- Q^^t'^Q.Pl' 



and 

(2.18) al 



4^/F^(^/7Y+/f^)3 

[PiQi)]i-5(l - 2P2)^ + [P2Q2]^-^(1 - 2Pi)^ 
4 VHQl7Wi[/FWr + 



By Theorem 2.2, the variances in (2.16), (2.17) and (2.18) attain the 
Cramer-Rao lower bounds in their respective allocations. 

Example 3 (Continued). By using = (Cfc, (^fc — Mfc)^)' as the responses, 
the conditions of Theorems 2.1 and 2.2 are satisfied. Thus, for the three al- 
locations in (2.8), (2.9) and (2.10), we have ^{Nn,i/n - Vi) ^ 'N{0,af) 
(i = 4, 5, 6) with 

(2.19) al - ^^"2^V7^ 



2(ri^//I^ + r2^/iI' 
(2.20) al = , ^^^^ 

and 

8(TlT2)^/3 



|2' 



(2.21) 



These variances in (2.19), (2.20) and (2.21) attain the Cramer-Rao lower 
bounds in their respective allocations. 
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Table 1 

Results for Case 1. Simulated and theoretical allocation proportions (Nn^i/n) are given 
for different designs with their variances nvar(A^„_i/n) given in the parentheses. The 
target allocation proportion is vi — {1 — P2)/(2 — Pi — P2). The simulation used 1000 

trials of n — 100 



ERADE with 

DBCD 

ERADE and DL a = 1/2 a = 2/3 DL with 7 = 2 



Pi, P2 Asymptotic Simulated Simulated Simulated Asymptotic Simulated 



0.9, 





7 





75 


(0.75) 





72 


(0 


69) 





72 


(0 


73) 





64 


(0 


39) 





75 


(0 


94) 





74 


(0 


91) 


0.9, 





6 





80 


(0.48) 





78 


(0 


49) 





77 


(0 


51) 





69 


(0 


34) 





80 


(0 


61) 





78 


(0 


58) 


0.9, 





5 





83 


(0.32) 





81 


(0 


34) 





80 


(0 


36) 





73 


(0 


24) 





83 


(0 


42) 





81 


(0 


39) 


0.9, 





3 





88 


(0.16) 





85 


(0 


17) 





85 


(0 


18) 





79 


(0 


14) 





88 


(0 


22) 





86 


(0 


21) 


0.8, 





8 





50 


(10.00) 





50 


(0 


75) 





50 


(0 


82) 





50 


(0 


51) 





50 


(1 


25) 





50 


(1 


16) 


0.8, 





7 





60 


(0.72) 





59 


(0 


65) 





59 


(0 


63) 





57 


(0 


43) 





60 


(0 


91) 





60 


(0 


80) 


0.8, 





6 





67 


(0.52) 





66 


(0 


52) 





65 


(0 


49) 





62 


(0 


35) 





67 


(0 


67) 





66 


(0 


63) 


0.7, 





5 





63 


(0.35) 





62 


(0 


35) 





62 


(0 


36) 





60 


(0 


30) 





63 


(0 


47) 





62 


(0 


45) 


0.7, 





3 





70 


(0.21) 





69 


(0 


20) 





69 


(0 


23) 





68 


(0 


18) 





70 


(0 


29) 





69 


(0 


28) 


0.6, 





4 





60 


(0.24) 





60 


(0 


24) 





59 


(0 


25) 





59 


(0 


23) 





60 


(0 


34) 





60 


(0 


31) 


0.5, 





5 





50 


(0.25) 





50 


(0 


22) 





50 


(0 


23) 





50 


(0 


21) 





50 


(0 


35) 





50 


(0 


33) 


0.5, 





2 





62 


(0.13) 





61 


(0 


13) 





61 


(0 


15) 





61 


(0 


13) 





62 


(0 


20) 





61 


(0 


20) 


0.4, 





3 





54 


(0.13) 





54 


(0 


13) 





54 


(0 


13) 





54 


(0 


13) 





54 


(0 


21) 





54 


(0 


20) 


0.2, 





2 





50 


(0.06) 





50 


(0 


06) 





50 


(0 


06) 





50 


(0 


07) 





50 


(0 


13) 





50 


(0 


12) 



Remark 2.4. In Theorem 2.1, the sample proportion Nn^i/n converges 
to V (target proportion) at the rate of The asymptotic results in 

Theorems 2.1 and 2.2 do not depend on a. This is because the allocation 
probability is a discrete function and the first order approximation does not 
depend on a. In practice, we need to choose a suitable a. We will discuss 
this in next section. 

3. Simulation study. We conducted Monte Carlo simulations to exam- 
ine the finite sample performance of the proposed ERADE. As shown by 
Hu and Rosenberger (2003), the efhciency (power) of a randomized design 
is a decreasing function of the variability of the design, so the variability 
of the allocation proportions is the main criterion in our investigation. We 
compared the doubly adaptive biased coin design (DBCD) of Hu and Zhang 
(2004), the drop~the-loser (DL) rule of Ivanova (2003), the randomized 
player-the-winner (RPW) rule of Wei and Durham (1978) and our proposed 
ERADE with a = 1/2 or 2/3. Here, we use a = 1/2 or 2/3, based on Bur- 
man's studies of Efron's biased coin design. Note that the DL rule and the 
RPW rule apply only to binary responses by targeting the urn allocation 
proportion, which is not optimal. On the other hand, the ERADE applies 
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Table 2 

Results for Case 2 with the target allocation proportion V2 = VPi/{\/^ + V^)- 
Simulated and theoretical allocation proportions (Nn^i/n) are given for different designs 
with their variances nvar(A^,i,i/n) given in the parentheses. The simulation used 1000 

trials of n — 100 



ERADE with 



Pi, 


P2 


ERADE 


a = 


1/2 


a = 


2/3 




with 7 


= 2 




Asymptotic 


Simulated 


Simulated 


Asymptotic 


Simulated 


0.9, 


0.7 


0.53 (0.02) 


0.53 


(0.02) 


0.53 


(0.03) 


0.53 


(0.07) 


0.53 


(0.07) 


0.9, 


0.6 


0.55 (0.03) 


0.55 


(0.03) 


0.55 


(0.04) 


0.55 


(0.08) 


0.55 


(0.08) 


0.9, 


0.5 


0.57 (0.04) 


0.57 


(0.04) 


0.57 


(0.06) 


0.57 


(0.09) 


0.57 


(0.11) 


0.9, 


0.3 


0.63 (0.09) 


0.64 


(0.12) 


0.63 


(0.13) 


0.63 


(0.15) 


0.64 


(0.18) 


0.8, 


0.8 


0.50 (0.02) 


0.50 


(0.02) 


0.50 


(0.03) 


0.50 


(0.07) 


0.50 


(0.07) 


0.8, 


0.7 


0.52 (0.02) 


0.52 


(0.03) 


0.52 


(0.03) 


0.52 


(0.08) 


0.52 


(0.08) 


0.8, 


0.6 


0.54 (0.03) 


0.54 


(0.04) 


0.53 


(0.04) 


0.54 


(0.09) 


0.54 


(0.09) 


0.7, 


0.5 


0.54 (0.05) 


0.54 


(0.05) 


0.54 


(0.06) 


0.54 


(0.10) 


0.54 


(0.12) 


0.7, 


0.3 


0.60 (0.09) 


0.60 


(0.12) 


0.60 


(0.13) 


0.60 


(0.16) 


0.61 


(0.21) 


0.6, 


0.4 


0.55 (0.07) 


0.55 


(0.08) 


0.55 


(0.09) 


0.55 


(0.13) 


0.55 


(0.15) 


0.5, 


0.5 


0.50 (0.06) 


0.50 


(0.07) 


0.50 


(0.08) 


0.50 


(0.13) 


0.50 


(0.14) 


0.5, 


0.2 


0.61 (0.17) 


0.62 


(0.19) 


0.61 


(0.22) 


0.61 


(0.25) 


0.62 


(0.30) 


0.4, 


0.3 


0.54 (0.12) 


0.53 


(0.14) 


0.54 


(0.16) 


0.54 


(0.19) 


0.54 


(0.25) 


0.2, 


0.2 


0.50 (0.25) 


0.50 


(0.28) 


0.50 


(0.33) 


0.50 


(0.35) 


0.50 


(0.48) 



to all types of responses and can be used to target any desired allocation 
proportion. 

Case 1. The data were generated from the binary responses based on 
Example 2. First, we consider the urn allocation proportion in (2.5), and 
the results are given in Table 1. By Hu and Rosenberger (2003), the RPW 
rule has much higher variability than the DBCD, so we do not include the 
results of the former here. For the ERADE, niQ = 2 and a = l/2ora = 2/3 
are considered. For the DL rule, we start with 5 type 1 balls, 5 type 2 balls 
and 1 type ball. For the DBCD, we use the allocation function of Hu 
and Resenberger (2003), with 7 = 2. The results in Table 1 are based on 
1000 simulated trials with sample size 100. Prom Table 1, we see that the 
ERADEs with q = 1/2 and a = 2/3 performed better than other two pro- 
cedures in general. The DBCD had larger variances than other procedures. 
Both the ERADE and the DBCD attained the target allocation proportion 
quite accurately, and the simulated variance agreed with their correspond- 
ing asymptotic variances quite well, except Pi = P2 = 0.80. When Pi and P2 
were large and different, the DL rule did not converge to the target allocation 
proportion as fast as other procedures. However, its finite-sample variances 
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were much smaller than the corresponding asymptotic variances. When both 
Pi and P2 were small, both the DL rule and the ERADE performed well. 

Case 2. When the target allocation is the optimal allocation proportion 
in (2.6), the DL rule no longer applies. The comparison between the ERADE 
and the DBCD are given in Table 2. Both designs converged well to the op- 
timal allocation proportion, but the ERADE performed better with much 
smaller variability. Similar numerical results were obtained for the Neyman 
allocation proportion in (2.7), which are not reported here. We also consid- 
ered the cases with sample sizes 50 and 200, but only to reach the same 
conclusions. 

Remark 3.1. It is easy to see that a is related to the randomness of the 
design. When a is smaller, the ERADE is more determined and could have 
smaller variability. To study this, we have run simulations (both Cases 1 
and 2) for a = 1/8, 1/4, 1/2, 2/3 and 3/4 with different sample sizes n = 50, 
100 and 200. We found that the simulated results of a = 1/8 and 1/4 are 
very similar to the results of a = 1/2 in terms of allocation proportion and 
its variability. The ERADE with a = 3/4 has a slightly larger variability 
than others. Therefore, it is reasonable to recommend choosing a between 
0.4 and 0.7. This agrees with Burman's (1996) study of biased coin designs. 

Remark 3.2. The simulated results in Tables 1 and 2 are based on bi- 
nary responses from Example 2. We have also conducted some Monte Carlo 
simulations based on continuous responses from Example 3. We considered 
the doubly adaptive biased coin design (DBCD with 7 = 2) of Hu and Zhang 
(2004) and our proposed ERADEs with a = 1/2 and 2/3. We obtained sim- 
ilar results as reported in Table 2 with sample size n = 100. All designs con- 
verged well to the target allocation proportion, but the ERADE performed 
better with smaller variability. 

4. Analysis of ECMO trials, where ERADE would make a difference. 

Extracorporeal membrane oxygenation (ECMO) is an external system for 
oxygenating the blood based on techniques used in cardiopulmonary bypass 
technology developed for cardiac surgery. In the literature, there are three 
well-documented clinical trials on the evaluation of the clinical effectiveness 
of ECMO. These are the Michigan ECMO study [Bartlett et al. (1985)], the 
Boston ECMO study [Ware (1989)] and the UK ECMO trial [UK Cohabo- 
rative ECMO Trials Group (1996)]. The Boston ECMO study is based on a 
two-stage design, so we restrict our discussion to the other two studies here. 

In the Michigan ECMO study, the randomized player-the-winner rule 
was used to allocate patients to two treatments. Out of 12 patients, only 
one infant received conventional therapy. The first infant was assigned to 
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ECMO and survived. The second was assigned to conventional therapy and 
died. Ten subsequent infants were randomized to ECMO and all survived. 
This trial provided very little information about survival rates in the same 
population treated with conventional therapy, because only one infant was 
assigned to the conventional therapy. It turned out that this patient was the 
most severely ill patient in the study. The high variability of the randomized 
player-the- winner rule was responsible for the failure of the Michigan ECMO 
study. 

The UK ECMO trial used randomized allocation with equal proportions, 
and there were 93 patients in the ECMO and 92 in the conventional treat- 
ment for a total of 185. Prior to discharge from hospital, there were 28 deaths 
in the ECMO treatment and 54 deaths in the conventional treatment. We 
will use Pi =65/93 and P2 = 38/92 as the estimated success probabilities of 
the ECMO and the conventional treatment, respectively. Taking those val- 
ues as the success probabilities, we discuss what will happen if the ERADE 
is used in such cases. If the urn allocation is the target allocation, and the 
ERADE (with a = 1/2, ttiq = 2) is used for treatment allocation, then there 
will be about 121 patients in the ECMO and 64 patients in the conventional 
treatment, on average. Based on 10,000 simulated trials, the variability 
is 0.28, and the expected number of deaths is 74 death, as compared to 82 
in the actual trial. The adaptive design utilizes the better treatment more 
often to save lives. 

Next, we compare the variability and power of the ERADE and the RPW 
rule under the settings of Pi = 65/93 and P2 = 38/92. The expected power 
under both designs is 0.969. Because they are randomized procedures, the 
total number of patients assigned to the ECMO group is a random vari- 
able. Based on the 10,000 simulated trials, we noticed that in 99 percent 
of the trials under the ERADE there were more than 52 patients assigned 
to the conventional treatment group, for a power of 0.941 or higher. Under 
the RPW rule, only 39 or fewer patients were assigned to the conventional 
treatment in 1 percent of the trials, for a power of 0.904 or less. Also, based 
on the 10,000 simulated trials, the ERADE always assigns more patients to 
the ECMO group; however, the RPW rule assigned more patients to the 
conventional group in 114 trials. The number of patients assigned to the 
ECMO group for the ERADE and the RPW are summarized in the box 
plots of Figure 1. Even at the sample size 185, we can see the advantage of 
using the proposed ERADE over the randomized player-the-winner rule. 

In practice, one may modify the RPW by following changes: (i) first assign 
mo (say mo = 2) subjects to each treatment by using restricted random- 
ization; (ii) update the urn based on their (first 2mo subjects) responses 
according the RPW rule (add one same type ball if a success, add one op- 
posite type ball if a failure); (iii) use the RPW rule after the first 2mo 
subjects. For a small size clinical trial (e.g., the Michigan ECMO study), 
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Fig. 1. Box plots for the number of patients assigned to the ECMO group based on 10,000 
simulated trials. 



the proposed modification will reduce the variability of the RPW rule. For 
relatively large size trial (e.g., the UK ECMO trial), the modification with 
a small mo (say mo = 2) will not reduce the variability significantly. How- 
ever, the modification with a large mo is not recommended, because it will 
push the RPW rule to balance the allocation proportion. Also, this modifi- 
cation does not change the asymptotic properties of RPW. It is worthwhile 
to point out that urn models usually have much larger asymptotic variability 
than their corresponding ERADEs (with the same limiting allocation pro- 
portion) by comparing Theorem 2.1 with the results of Bai and Hu (2005) 
and Zhang, Hu and Cheung (2006). 

5. Conclusion remarks. We have proposed a family of response-adaptive 
designs that are fully randomized and asymptotically efficient. The ERADE 
can be viewed as a generalization of the Efron's biased coin design for any 
desired allocation function, which may depend on the unknown parameters. 
The asymptotic properties derived here provide the theoretical foundation 
for inferences based on the ERADE. They also help to understand why 
Efron's biased coin design is efficient for balanced trials. The examples in 
Sections 3 and 4 provide a guideline for the use of the proposed ERADE in 
practice. 



14 



F. HU, L.-X. ZHANG AND X. HE 



In this paper, we have assumed that the responses in each treatment 
group are available without delay. In practice, there is no logistical difficulty 
in incorporating delayed responses into the ERADE, provided that some 
responses become available during the course of the allocation in the experi- 
ment; thus, we can always update the estimates whenever new data become 
available. For clinical trials with uniform (or exponential) patient entry and 
exponential response times [see, e.g., Bai, Hu and Rosenberger (2002)], it is 
easy to verify that both Theorems 2.1 and 2.2 remain true. 

In Theorem 2.1, we assumed that the parameter estimate has Bahadur- 
type representation as in (2.1). This representation is usually satisfied in 
applications (e.g., moment estimation, maximum likelihood estimation, es- 
timate based on estimating equation, etc.). Theorem 2.1 ensures the con- 
sistency and asymptotical normality of the parameter estimate after using 
ERADE in clinical trial. Baldi Antognini and Giovagnoli (2005) have con- 
sidered the properties of maximum likelihood estimation for some other 
response-adaptive designs. Also, we have focused on ERADE for two treat- 
ments. Recently, Tymofyeyev, Rosenberger and Hu (2007) proposed a gen- 
eral framework to obtain optimal allocation proportions for comparing two 
or more treatments. How to generalize ERADE to i^T-treatment case is a 
new research topic. 

In some clinical trials, it is important to take covariates into consideration. 
To balance known covariates in two treatment groups, Pocock and Simon 
(1975) proposed a covariate-adaptive randomization procedure by using 
Efron's biased coin design. Since an important goal of clinical trials is to 
estimate a treatment effect, one may assign subjects to treatments according 
to their covariates for efficiency [Atkinson (1982)]. In some clinical trials, one 
may assign subjects to treatments according to their covariates for ethical 
reasons [see, e.g., Zhang et al. (2007)]. The ERADE could play an impor- 
tant role in this area; further research beyond the work in the present paper 
is clearly needed to account the effect of covariates in response-adaptive 
designs. 



To prove the main theorems, we first prove the following lemma. 

Lemma A.l. Suppose {Mn,J-n'in > 1} is a martingale with E|AM„p < 
Co, where AMn = Mn — Mn~i and {J-n} is a filter of sigma fields with 
J^i C C ■ ■ ■ . Then, for any L > 4, 



APPENDIX: PROOFS 



(A.l) 



E max\Mn-M, 



m<L 



n—m 



(A.2) 
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Proof. It is easy to show (A.l). For (A. 2), we have 

\Mn-Mn-n. 



E 



max 

m:L<m<n TTl 

< E E 

j:log2L<i<log2n 



Mn-Mn-m. 

max ^-j 



□ 



Proof of Theorem 2.1. Let J^m = cr{Xi^i,...,Xm,i,$i,---,tm) be 
the sigma field generated by previous m stages. Then, under Tm-x-, 
and are independent, and E[Xm,i|J'>n--i] = Pm- Let M„ = J2m=i^^m, 
where AM„ = X^ i — E[Xm,i\^m~i]- Then, {AMm;m > 1} is a sequence of 
bounded martingale differences with |AMm| < 1, and 



A^n,l = ^n-l,l+Pn + AM„. 



Write 

Then, 
(A.3) 



n-l 



m=Q 



Nn,l - npn < iVn-1,1 - (n - l)pn_l + AUn 

if iV„_i,i - (n - l)p„_i > 0, 



where AC/„ = Un — Un-i- Let I = ln = max{m : 2mo + 1 < m < n, A^^.i — 
fnpm < 0}, where max0 = 2mo. According to (A.3), 

Nn,l - npn < AUn + AUn-1 + ■■■ + AUi+i + Ni^i - Ipi 
= Un-Ui + Ni^i - Ipi. 

Notice that Ni^i — Ipi < ii I > 2mQ + 1, and A'^; i — Ipi < 2mo if I = 2mQ. So, 
in either case 

(A.4) Nn,i - npn <Un-Ui + 2mo. 

We will show that 



(A. 5) Un-Ui^ <op{y/n) and [/„ - Ui^ < 0{^/nloglogn) a.s. 
If (A.S) is proved, then 

Nn,i - npn<op{^/n) and iV„,i - npn < 0(v/raloglogn) a.s., 
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and, by symmetry, 

(n - A'^^,!) - n{l - pn) < op{y/n), 

{n - Nn,i) - n{l - pn) < 0{^/n\oglogn) a.s. 

(2.11) is proved according to (A. 4) and (A.S). 

For proving (A. 5), we first show that Nn^i/n — > v a.s. If A^^^i — > c3o (resp. 

An, 2 = n — Nn^i — > oo), then — > 6i (resp. 0„.2 — > O2) by Lemma A. 4 of 
Hu and Zhang (2004). If sup„ A'„^i < 00 [resp. sup„(n — A^„,i) < 00], then 
the value of On,i (resp. 0n,2) will not change when n is large enough. In 
either case, 6n,i and 0ri,2 have finite limits. It follows that there is a it with 
< < 1, such that 

(A. 6) pn^ u a.s. 

by the continuity of p{-). On the other hand, according to the law of large 
numbers for martingales, 

(A.7) Mn = o{n) a.s. 

Combining (A. 6) and (A.7) yields C/„ ~ — n(l — a)u a.s. And then, by (A. 4), 
{Nn,i/n - pn)+ a.s. Similarly, ((1 - A^nj/n) - (1 - p„))+ ^ a.s. It 
follows that Nn.i/n — pn^ a.s., which, together with (A. 6), implies 

(A.S) lim — — = lim pn = u a.s. 

n— >oo fi n— >oo 

On the other hand, (A.S) implies that A^„^i — > 00 and n — Nn^i — > 00. By 
Lemma A. 4 of Hu and Zhang (2004), again, we have 0^ — > a.s. and 



(A.9) e.-S = 0[,M^) a.s. 



n 

So, the limit u in (A. 6) and (A.S) must be f = p{@), according to the 
continuity of p(-). 
Now, 

n-l 

Un-Ui = -{n a)v + ^ a{pm -v)-{n- l){pn - v) 

m=l 

(A.IO) + Mr, - Ml + l{pi - Pn) 

= {n- - a)v + 0(1)] +Mn-Mi + l{pi - pn) a.s. 

Also, according to (2.1) and the fact that Nn^i/n — > v a.s., it is easily shown 
that ^/n{@n — 0) = Op(l). So, we have 

dp 
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From (2.1), it follows that 

m-Pn)\<Cl\\&n-®l\\+Op{l) 
2 



< 



fc=i 



I 



n.k 



m=l+l 



k=l 



l\Ni^k - Nn,k\ 



1 



Ni,k 



E ^rn,k{im,k " ^fc' 



Op(^/n) 



= o(l) • (n - + CIIQn - QHI + op(\^), 

where Qn,k = Y.m=i^ni,k{^m,k~ ^k) and Q„ = (Qn,i,Qn,2), A; = 1,2. Notice 
that {Qn} and {M„} are both martingales. Notice that, for any l<L<n, 
we have 

I Qn Qn— rr? 



IQn — Qdl ^ ~ ■ ™ax ■ 

L<m<n 



m 



■ max ||Q„ — Qr 

m<L 



Now, choose L = L{n) — > oo with L = o{n). Then, by Lemma A.l, 

IQn-Q n—m 



max ■ 

L<m<n 



m 



and 

It follows that 
Similarly, 



max||Q„ - Q„_ 



= Op{^l/L) = op{l) 
Op{Vl)<op{V^). 



IIQn - Qill = • {n- I) + op{^/n). 

\Mn -Mi\= op{l) • (n - + op{^/^). 



We conclude that 

Un-Ui<{n- - a)v + op{l)] + \Mn - Mi\ + C||Q„ - Qi\\ + op(V^) 

< (n - - a)v + op(l)] + op(Vra) = op{^/^). 

The first part of (A. 5) is proved. For the second part, notice that 

Mn = 0(Vraloglog n) a.s. 

due to the law of iterated logarithm for martingales, and 

(A. 13) n(yO„ — v) = 0(-\/nloglogn) a.s. 

due to (A. 9) and (A. 11). The second part of (A.S) is proved by (A. 10). 
Finally, from Lemma 1 of Hu, Rosenberger and Zhang (2006), we have 

(A.14) V^(0„-0)^N(O,V). 

Together with (2.11) and (A. 12), (A.14) yields (2.13). Combining (2.11) and 
(A.13) yields (2.14). □ 
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